Privacy Preserving Parallel Clustering Based Anonymization for Big Data Using MapReduce Framework

نویسندگان

چکیده

Big data refers to a massive volume of collected from heterogeneous sources including Internet Things (IoT) devices. analytics is playing crucial role in extracting patterns that would benefit efficient and effective decision making. Processing this poses several critical issues such as scalability, security privacy. To preserve privacy, numerous privacy-preserving mining publishing techniques exist. Data anonymization utilizing for preserving an individual’s privacy promising approach prevent the against identity disclosure. In paper, Parallel Clustering based Anonymization Algorithm (PCAA) proposed, results prove algorithm scalable also achieves better tradeoff between utility. The MapReduce framework used parallelize process handling huge data. performs well terms classification accuracy, F-measure, Kullback–Leibler divergence metrics. Moreover, big generated are efficiently protected meet ever-growing requirements application.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Model Based Framework for Privacy Preserving Clustering Using SOM

Privacy has become an important issue in the progress of data mining techniques. Many laws are being enacted in various countries to protect the privacy of data. This privacy concern has been addressed by developing data mining techniques under a framework called privacy preserving data mining. Presently there are two main approaches popularly used -data perturbation and secure multiparty compu...

متن کامل

An Local-recoding Anonymization with Mapreduce for Scalable Big Data Privacy Preservation in Cloud

Data privacy preservation is one of the most disturbed issues on the current industry. Data privacy issues need to be addressed urgently before data sets are shared on cloud. Data anonymization refers to as hiding complex data for owners of data records. Cloud computing provides promising scalable IT infrastructure to support various processing of a variety of big data applications in sectors s...

متن کامل

Big Data Using Pre-processing Based on Mapreduce Framework

Now a day enormous amount of data is getting explored through Internet of Things (IoT) as technologies are advancing and people uses these technologies in day to day activities, this data is termed as Big Data having its characteristics and challenges. Frequent Itemset Mining algorithms are aimed to disclose frequent itemsets from transactional database but as the dataset size increases, it can...

متن کامل

Privacy-preserving anonymization of set-valued data

In this paper we study the problem of protecting privacy in the publication of set-valued data. Consider a collection of transactional data that contains detailed information about items bought together by individuals. Even after removing all personal characteristics of the buyer, which can serve as links to his identity, the publication of such data is still subject to privacy attacks from adv...

متن کامل

A centralized privacy-preserving framework for online social networks

There are some critical privacy concerns in the current online social networks (OSNs). Users' information is disclosed to different entities that they were not supposed to access. Furthermore, the notion of friendship is inadequate in OSNs since the degree of social relationships between users dynamically changes over the time. Additionally, users may define similar privacy settings for their f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied Artificial Intelligence

سال: 2021

ISSN: ['0883-9514', '1087-6545']

DOI: https://doi.org/10.1080/08839514.2021.1987709